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Abstract. Assume X„ is a random sample of n uniform, independent 
points from a triangle T. The longest convex chain, Y , of Xn is defined 
naturally (see the next paragraph). The length \Y\ of y is a random 
variable, denoted by Ln. In this paper we determine the order of mag- 
nitude of the expectation of L„. We show further that L„ is highly 
concentrated around its mean, and that the longest convex chains have 
a limit shape. 



1. Introduction and results 

Let T C be a triangle with vertices Po-,pi-,P2 and let X C T be a 
finite point set. A subset y C A is a convex chain in T (from po to P2) 
if the convex hull of y U {pO)P2} is a convex polygon with exactly |y| + 2 
vertices. A convex chain Y gives rise to the polygonal path C{Y) which 
is the boundary of this convex polygon minus the edge between pQ and p2- 
The length of the convex chain Y is just \Y\. 

For most part of this paper we assume that X = A„ is a random sample 
of n random, uniform, independent points from T. Let L„ be the length of 
a longest convex chain in A„. The random variable L„ is a distant relative 
of the "longest increasing subsequence" problem, cf. [Ij. In this paper we 
establish several properties of L„. The first concerns its expectcition, lEZ^^^. 

Theorem 1.1. There exists a positive constant a for which 

lim -—-=r- = a . 



Theorem II. H together with some geometric arguments based on Theo- 
rem 12.11 below, implies that the longest convex chains have a limit shape 
r in the following sense. Let C(A„) be the collection of all longest convex 
chains from A„. For every e > 

lim P(dist(C7(y),r) > e for some Y G C(A„)) = 0, 

n— >oo 

where dist(., .) stands for the Hausdorff distance. In fact, the statement of 
Theorem 11.31 is much stronger, because there e also converges to 0. The 
limit shape turns out to be the unique parabola arc F C T that is tangent 
to the sides PqPi at po and piP2 at p2> see Figure [Da). The parabola arc F 
will be called the special parabola in T. 

The proof of the 'limit shape' result is based on the following theorem, 
saying that L„ is highly concentrated around its expectation. 
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Theorem 1.2. For every 7 > there exists a constant N, such that for 
every n > N 

P(|L„ - EL„\ > n^/^) < n~^'/i^ 

For the quantitative version of the hmit shape theorem we fix our triangle 
T as T = conv{(0, 1), (0, 0), (1, 0)}. 

Theorem 1.3. Lei 7 > 1 and define e = 8/27^/2 ^"i/i2(-^Qg ^)i/4^ y/^g^ 
there exists N > 0, depending on 7, such that for every n > N , 

P(dist(C(y),r) > e for some Y G C(X„)) < 2n"^'/i^. 

2. Preliminaries 

When choosing one random point in triangle T, the underlying probability 
measure is the normalized Lebesgue measure on T. Most of the random 
variables treated in this paper (e.g. Ln) are defined on the nth power of 
this probability space, to be denoted by T®". In this case P denotes the nth 
power of the normalized Lebesgue measure on T. 

Throughout the paper, A stands for the (Lebesgue) area measure on the 
plane. So when choosing n independent random points in T, the number 
of points in any domain D C T is a binomial random variable of distri- 
bution B{n, A{D)/ A{T)). Hence the expected number of points in D is 
nA{D)/A{T). 

For binomial random variables we have the following useful deviation 
estimates, which are relatives of Chernoff's inequality, see [2], Theorems 
A. 1.12 and A. 1.13, pp 267-268. If K has binomial distribution with mean 
value k > 1 and c > 0, then 

(2.1) F{K < k-c^/k\ogk) < k'"^/^. 
On the other hand, for c > 1, 

(2.2) F{K > ck) < (^^y'' . 
We will use (|2.ip often, mainly with c = 1. 




Figure 1. The special parabola 
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The special parabola arc F in T is characterized by the fact that it has the 
largest affine length among all convex curves connecting po and p2 within 
T. (For the definition and properties of affine arc length see [6] or [3].) This 
is a consequence of the following theorem from [6]. Assume that a line i 
intersects the sides [poiPi] resp. [pi,P2] at points qo and q2- Let qi be a 
point on the segment [goi92] and write Ti resp. T2 for the triangle with 
vertices po,qo,qi resp. qi,q2,P2, see Figure El 

Theorem 2.1. [6j Under the above assumptions 

^A(lT)+ ^/A(7^< ^A(r)- 
Equality holds here if and only if qi € F and i is tangent to F at qi. 

The equality part of the theorem implies the following fact. Assume that 
Po = 90) Qi, ■ ■ ■ , Qk = P2 are points, in this order, on F. Let Tj be the triangle 
delimited by the tangents to F at qi^i and qi, and by the segment [qi-i,qi], 
i = 1, . . . , k; see Figure [Hb). 

Corollary 2.1. Under the previous assumptions Yli=i = ^A(T). 

In particular, when A(Tj) = t for each i = 1, . . . , k — 1 and A(Tfc) < t, then 
k-l< ^A{T)/t < k. 

We will need a strengthening of Theorem 12. li Assume go resp. q2 divides 
the segment [poiPi] resp. [pi,P2] in ratio a : (1 — a) and 6 : (1 — b), see 
Figure [H 




Figure 2. Characterization of F 

Theorem 2.2. With the above notation 

^/Wi) + ¥MT2) < ^/W)- ^/MT)\{a-bf. 

Proof. Let c be a number between and 1 so that qi divides the segment 
[^0) 0.2] in ratio c : (1 — c). Then, writing K[xyz) for the area of the triangle 
with vertices x,y,z, 

MPoqoqi) = aA{popiqi) = acA{popiq2) = abcA{popip2) , 

showing A(ri) = abcA{T). Similarly, A(r2) = (1 - a)(l - 6)(1 - c)A(r). 
Hence we have to prove the following fact: < a, 6, c < 1 implies 

(2.3) 1 - - ^(1 -a){l-b){l-c)>\{a- bf. 
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Denote Q the left hand side of (j2.3|) . By computing the derivative of Q with 
respect to c yields that for fixed a and 6, Q is minimal when 

yah 



^(l-a)(l-6) 

It is easy to see that with this c 



3, 



■\ 2/3 



Now, denote ( yob + y^(1 — a)(l — 6) j by 1 — n, so 



u = a + b-2ab- 2 Va6(r^^a)(r^^. 
We claim that u > {a — b)'^: this is the same as 



a - + 6 - 6^ > 2^(0 - a2)(6 - 62)^ 

which is just the inequality between the arithmetic and geometric means for 
the numbers a — a , b-b^ >0. Therefore, using n < 1, 

Q>l-{l-uy^^ >\u>Ua-bf. □ 

Theorems 12.11 and 12.21 imply the following 

Corollary 2.2. If qi € T and i is tangent to T at qi, then with the above 
notations, a = b. 

It is clear that the underlying triangle T can be chosen arbitrarily, since 
an affine transformation does not influence the value of -L„. Our standard 
model for T is the one with pQ = (0, 1), pi = (0, 0), p2 = (1, 0) as the vertices 
of T. In this case the special parabola T has equation ^/x + ^/y = 1. 

3. Other models 

There are several choices for the underlying finite set X. For instance, 
consider the lattice jZ^ where is the usual lattice in and t > is 
large, and set X = T n jZ^. Clearly, n := \X\ A{T)t^ as t ^ 00. Write 
Yn G X for a longest convex chain in T. It is shown in [5j that, as t — > 00 
(or n — > 00), 

(3-1) \yn\ = + oil)) = J^n^Hl + oil)). 

This result is analogous to Theorem ll.il except that in the lattice case the 
value of the constant is known to be 6/(27r)2/^, while in the present paper 
only the existence of the limit a is shown, together with 1.5 < a < 3.5, see 
Section HI This is similar to the longest increasing subsequence problem, 
[T], where it is easy to see that the expectation is of order -^/n, but proving 
the precise asymptotic formula 2-y/n(l + o(l)) turned out to be difficult, cf. 
[8] and [12]. In our case, numerical experiments suggest that a = 3 and we 
venture to conjecture that this is the actual value of a. 

More generally, let K C M? he a convex compact set with nonempty 
interior, and set Xt = Kdjl?. A set Y C Xtis said to be in convex position 
if no point of Y lies in the convex hull of the others. In other words, the 
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convex polygon conv Y has exactly |y| vertices. Let It be a maximum size 
subset of Xt which is in convex position and set m{K,t) = \Yt\. It is shown 
in [5] that 

(3.2) m{K,t) = ^-A_A*(i^)t2/3(i + o(i)) 

where A*{K) denotes the supremum (actually, maximum) of the affine 
perimeter that a convex subset of K can have. The main difficulty lies 
in the case of triangles, that is, proving (13. ip . 

These results can be extended, quite easily, to the present case when X„ 
is a random sample of n uniform independent points from K. For instance, 
writing Yn for the maximum size subset of Xn in convex position, one can 
show the following. 

Theorem 3.1. Under the above conditions 

hm n-V3E|y„| = 

Here a is the constant from Theorem 

One can also prove that conv Yn has a limit shape, namely, the unique 
convex subset of K whose affine perimeter is equal to A*(K). The proofs 
are almost identical to those used in [5], so we do not repeat them here, 
instead we rather explain what is different and more interesting. 

Another random model is when X comes from a homogeneous planar 
Poisson process X{n) of intensity n/A(T). Given a domain D in the plane, 
m{D) = \X[n) n Z)|, the number of points in D, has Poisson distribution 
with parameter A = nA{D)/A(T), i.e. 

F{m{D) = k) = e-^\^/k\ . 

We can also think of the Poisson model as follows: for a domain D, 
we first pick a random number m according to the corresponding Poisson 
distribution, and then choose m random, independent, uniform points in D. 
The advantage of the Poisson model is that the number of points of X{n) 
in disjoint domains are independent random variables, unlike in the uniform 
model. 

As is well known, the uniform model X„ and the Poisson model X{n) 
behave very similarly. In particular, Theorems 11.11 11.21 and 11.31 remain 
valid for the Poisson model as well, with essentially the same quantitative 
estimates. The proofs are quite standard, and we do not go into the details. 
Actually, the proof of Theorem 11.31 is simpler in the Poisson model since 
there the subtriangles behave the same way as any other triangle. 

The longest increasing subsequence problem has been almost completely 
solved by now, see [Ij. In this respect, our results only constitute the first, 
and perhaps the simplest, steps in understanding the random variable L„. 



4. Expectation 



The main target of this section is to prove of Theorem II. 1[ We also 
establish upper and lower bounds for the constant involved. 
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Proof of Theorem ll.il We start with an upper bound on EL„: 

EL 

(4.1) lim sup < v/2e = 3.4248 .... 

It is shown in [3], equation (5.3) (cf. [5] as well) that the probability of k 
uniform independent random points in T forming a convex chain is 

2fc 



Therefore the probability that a convex chain of length k exists is at most 
(^)2*^/(fe! {k + 1)!). In other words 



P(i^n >k)< 



n 



k) A;!(A; + 1)! ' 



We use this estimate and Stirling's formula to bound EL„. Assume 7 > \f2e. 
Then 

n n 

ELn = Y^HLn >k)< J]P(L„ > k) 

k=0 k=0 

<7v^+ ^iLn>k) 

fc>7 



1 /2e^^'' 



<7^ + n-V2c7, 

where C = 7^/ (7^ — 2e^) is a positive constant. Since this holds for arbitrary 
■y > \/2e, (jiTTI) is proved. 

Next we establish a lower bound for EL„. We use the second half of 
Corollary 12.11 with t = 2A(T)/n. So we have triangles Tj of area t for 
1 < i < k — 1, and the last triangle Tk of area less than t. By ()2.ip 
/c > -y/n/2. Let X„ be the uniform independent sample from T. Let be 
a point of Tj n provided that Tj n Xn / 0. The collection of such Xj's 
forms a convex chain. Hence the expected length of the longest convex chain 
is at least the expected number of non-empty triangles Tj, so 

EL„ > ^P(T,nX„/0) > (A: - 1) (^1 - (^1 - ^ 
> \ - - A (1- e'^) w 0.6862 n^/^. 
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What we have proved so far is that 

a = hminf n"^/^ELn > 0.6862, and a = lim sup n"^/^EL„ < 3.4249. 

We show next that the Umit exists. Suppose on the contrary that a<a. 

The idea of the proof is to use the second half of Corollary 12.11 again, 
with the longest convex chain in the small triangles having length close to 
the limsup , while in the large triangle, EL„ is close to the liminf. For 
convenience, we suppose that A{T) = 1. 

Choose a large n with EL„ > (1 — e) a-y/n, and an N much larger than 
n with ELtv < (1 + £)a\^. Here e is a suitably small positive number. 
Define ni so that the equation n = ni — \/ni log rii holds. 

Choose uniform, independent random points from triangle T. Define 
t = ni/N. Hence the expected number of points in a triangle (contained in 
T) of area t is ni. 

Apply the second half of Corollary 12.11 with this t. Then the number of 
triangles, k, satisfies k > yN/ni. 

Denote by ki the number of points in Tj, and by EL* the expectation 
of the length of the longest convex chain in Tj. Clearly ki has binomial 
distribution with mean ni, except for the last triangle where the mean is 
less than ni. 

Since the union of convex chains in the triangles Tj is a convex chain in 
T between (0,0) and (1, 1), by estimate (|2.1|) we have 

ELAr>^EL* > ^ ¥{ki>n)W.Ln 

i<k i<k—l 
i<k-l 

> (^iV/^ " " ^^^^^) (1 " 

= a ^(1 - e) (l - % ^/^) (vW^ - 

>a\^{l-2e), 

where the last inequality holds if n is chosen large enough and N is chosen 
even larger with n/N very small. Thus (1 + e) a > (1 — 2e)a which, for 
small enough e, contradicts our assumption a<a. □ 

Remark. The lower bound EL„ > 0.6862 n^/'^ is probably the easiest to 
prove. A better estimate, also mentioned by Enriquez [7], can be established 
as follows. Assume T is the standard triangle and let D denote the domain 
of T lying above F. Then A(D) = 1/3, so the expected number of points in 
D is 2n/3, and the number of points is concentrated around this expectation. 
The afRne perimeter of D is 2 \/l/2 (see [3] ) , and a classical result of Renyi 
and Sulanke [9] yields that expected number of vertices of conv(L> n Xn) is 
about 

^{l)\fl{l) '^'2{/^^^« 1.5772 ^ 
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Since most vertices are located next to the parabola, the majority of them 
form a convex chain, and so 

(4.2) liminf^^ > 1.5772... . 



This sketch can be completed with standard tools. From now on, we will use 
this estimate. Also, a will always refer to the limit constant of Theorem ll.il 

5. Concentration results for EL„ 

The concentration results proved here are consequences of Talagrand's in- 
equality from [To] which says the following. Suppose y is a real-valued ran- 
dom variable on a product probability space O*^", and that Y is 1-Lipschitz 
with respect to the Hamming distance, meaning that 

\Y{x)-Y{y)\ < 1 

whenever x and y differ in one coordinates. Moreover assume that Y is 
f -certifiable. This means that there exists a function / : N ^ N with the 
following property: for every x and b with Y(x) > b there exists an index 
set / of at most f{b) elements, such that Y{y) > b holds for every y agree- 
ing with X on /. Let m denote the median of Y. Then for every s > we have 



P(y < m - s) < 2 exp 

and 

P(y > m + s) < 2 exp 



4/M 

2 



-S 



^4/(m + s)^ 

When applied to Ln, these inequalities prove concentration about the me- 
dian, to be denoted by m„. Theorem 11.21 concerns the mean of Ln- However, 
concentration ensures that the mean and the median are not far apart, in 
fact, lim n~^/^mn = a. First we need a lower bound on m^- 

Lemma 5.1. Suppose that logn > 25. Then 



irin > ySn/logn. 

Since this is a special case of Lemma 16.11 from the next section, the proof 
will be given there. 

Proof of Theorem 11.21 The statement cries out for the application of Ta- 
lagrand's inequality. The random variable L„ satisfies the conditions with 
f{b) = b, since fixing the coordinates of a maximal chain guarantees that the 
length will not decrease, and changing one coordinate changes the length of 
the maximal chain by at most one. Write m = rrin for the median in the 
present proof. Setting s = P^/n^ogrn where /3 is an arbitrary positive 
constant, we have 

I r, I — ; \ . f —(3'^mlogm 1 

F{\Ln -m\> f3^/mlogm) < 4exp <^ — —y==- ^ 

{4:[m + p\/miogm) ) 

—0^ log m 1 



4 exp 



4(1 + /3v^m-i log m 
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Define now Pq = cym/logm with a constant c > 0, which will be fixed at 
the end of the proof in order to give the correct estimate. If /? < /3o, then 
i3\Jmr^ log m < c, and the denominator in the exponent is at most 4(1 + c). 
Thus 



(5.1) P(|L„ -m\> /3v^mlogm) < 4m 

On the other hand, for (3 > (3q we have 



(5.2) P(|Ln -m\> l3\/m logm) < P(|-L„ - m| > [5q\/ mlogm) 

= 4exp ( —m 



c2 



4(1 + c) 

Next, we compare the median and the expectation of Ln- 

/•CO 

|EL„ — m| < E|L„ — m| = / IP(|-^^n — Tn\ > x)dx. 

Jo 

The range of L„ is [l,n], so the integrand is if x > n. Substitute x 
P\/m log m, and divide the integral into two parts at Pq: 



\ELn - m| < 4y^m logm(/i + 12), 

where 

(5.3) Ii = / m-^'/4(i+^)d/3 < / m-^'/^(i+'=)d/3 



and 







/7r(l +c) 
logm ' 



pn/^mlogm / C \ / C \ 

(5.4) /2 = y^^ exp^-m^^Jd/3<nexp(^-m^^J. 

By Lemma [5T| n < m^, so I2 < m^exp(— mc^/4(l + c)). Since m„ goes to 
infinity as n increases (again by Lemma l5. ID . the bound on I2 is eventually 
much smaller than the one on Ii: 



|EL„ — m| < 4ymlogrTi(/i + I2) 
(5.5) < 4y^ 7r(l + c)m + A^/ m log m exp ^— m ^ ^ 

< 5V7r(l + c) 



m 

for all large enough n. Hence EL„ is of the same order of magnitude as m„, 
and we obtain 

(5.6) lim n"^/^EL„ = lim n~^/^m„ = a. 

For fixed 7 and for large enough n, (15.50 implies 



1/6^1 



P(|L„-EL„| >7vTog^n 

< P(|L„ - m| > 7V^logn n^/^ - |EL„ - m| ) 

< P(|L„ - m| > 7 A/log n n^/^ - 5 V7r(l + c)^/^' 
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Using m„ < 3.43n^/^ from ()4.1|) and (j5.6p . it is easy to see that 



/i 1/6 rrrr^^ — ^ ^ /31ogm-log41 5v^7r(l + c) 

7Vlogn n - 5V7r(l + c) m > 7^"^ ( \/ ^-^^ — 




Since for large enough n, ^y^3/3AA < /3o = cy^ni/ log m, ()5.ip finally implies 
P(|L„-EL„| >7v/i^ni/6) 

< P(|L„ -m\> 7^ ^^\/"ilogm) 

< 4m-3^'/i3.76{i+c) < ^-7^14 

with (|5.6p and the choice of c = 0.01. □ 

Remark. The constant in the exponent is far from being best possible. We 
have made no attempt to find its optimal value. In general, Talagrand's 
inequality is too general to give the precise concentration, see Talagrand's 
comments on this in |10j . 

6. SUBTRIANGLES 

For the proof of Theorem 11.31 we need to consider subtriangles S of T, 
that is, triangles of the form S = conv {a, b, c} with a,b,c £ T, while Xn is 
still a random sample from T. We will need to estimate the concentration of 
the longest convex chain from Xn in S. Since this random variable depends 
only on the relative area of S, we may and do assume that T is the standard 
triangle and S = conv{(0, ^), (0, 0), {^/s, 0)}. Thus A(5) = s/2. Write Ls,n 
for the length of the longest convex chain in S from (0, ^/s) to {y/s, 0), and 
nis^n for its median. In the following statements, we consider the situation 
when sn/2, the expected number of points from Xn in S, tends to infinity. 

As in the proof of Theorem ll.2[ we need two estimates: a lower bound for 
the median guarantees that the mean and the median are close to each other, 
while an upper bound for the expectation (or for the median) is needed for 
deriving the inequality in terms of n. Here comes the lower bound; the case 
s = 1 is Lemma |5. 11 

Lemma 6.1. Suppose that log(ns) > 25. Then 



> V3ns/log( ns). 

Proof. Set t = (A(5) log(ns))/(3ns), and apply the second half of Corol- 
lary [2T] to the triangle S. The number of triangles is k with 



\/3ns/ log (ns) < k < -^3ns/log(ns) + 1. 
For any i G {1, . . . ,k}, the probability that Tj contains no point of Xn is 

log(ns) ~ 



p(r, n x„ = 0) < 1 



3ns 

og( 
3s 



< exp ( ) = (ns)-V3^ < (ns)-V3 
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Hence the union bound yields 



r{Ln,s > ^/ 3ns /logins)) > 1 - F{Ti n X„ = for some i < k) 

> 1 - fc(ns)~^/3 

> 1- (^3/log(ns) + (ns)-i/3), 

which is greater than 1/2 by the assumption. □ 

Obtaining an upper bound for the mean is shghtly more dehcate; note 
that in the Lemma below s need not be fixed. 

Lemma 6.2. Assume ns — > oo. Then 

lim(ns)~^/^ELs,„ = a 

where a is the same constant as in Theorem\l.l\ 



Proof. Take any e > and choose A'^o (depending on e) so large that for 
every k > Nq, (1 — e)a < EL^, k~^^^ < (1 + e)a. The random variable 
K = \Xn n 5*1 has binomial distribution with mean ns. When ns is large 
enough, ns — \/ns log ns > Nq, and we use (|2.ip for a lower estimate: 

n 

ELs,n = Y,nK = k)ELk 

k=0 



> ¥{K > ns — \J ns log ns)(\ — e) a [ns — \J nslogns)^^"^ 

> (1 - (ns)~^/^)(l - e)a (ns - y/ns logns)^/^ 

> (1 - 2e)a(ns)^/^ 

For the upper bound, Jensen's inequality applied to -y/x comes in handy: 



EL,^n = Y.^{K = k)ELk 

k=0 

n 

< No F{K <No)+ F{K = k)ELk 



k=No 



<No + F{K = fc) (1 + e) a 

k=No 



<No + ¥{K >No){l + e)a 



n 



\K = k) 
J^^^¥iK>No) 




< No + F{K > iVo)2/^(l + e)Q(Ei^)^/3 

<No + {l + e)a (ns)^/^ < (1 + 2e) a (ns)^/^ □ 

Next, we derive the strong concentration property of Lg^n, the analogue 
of Theorem 11.21 

Theorem 6.1. Suppose t is a constant with < r < 1. Then for every 
7 > there exists a constant N, such that for every n > N and every 

s > n~'^ , 

P(|L,,„ -EL,,„| > 7Vlogns (ns)^/^) < ins)-^'/^\ 
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Proof. This proof is almost identical with that of Theorem 11.21 Since Lg^n 
is a random variable on T®", we can apply Talagrand's inequality with the 
certificate function f{b) = b in the same way as in the proof of Theorem II .21 
Write again m for 

TTis^ri,^ the median of Lg^^. Define /5o — ui j log vn with 
c = 0.01, then the estimates (|5.ip and (j5.2p remain valid with L^^n in place 
of Ln- Just as before, 

/•oo 

|ELs,n — 'm\< E|Ls^„ — m\ = \ P(|Ls^„ — m\> x)dx 

Jo 

= Ay/ m log m{Ii + I2) 

where /i and I2 are defined the same way as in (j5.3p and (j5.4p . Moreover, 
/i satisfies the inequality (|5.3p . With I2 we have to be a bit more careful. 

Note that s > n~'^ with r < 1 guarantees that Lemma 16.11 is applicable 
for n > exp(25/(l — r)). As x/logx is monotone increasing for x > e, 



3ns 3n^-r J n^~- (^,^^y, 

- y log(ns) - Y (1 - r) log n ^ V n(i-)/2 

for large enough n, and therefore by ()5.4p 
/2 < m^/d--) ' 



I2 < m ' ^ ' exp I — m 



4(1 + c) 



where of course 6/(1 — r) < 00. Lemma [6?T] implies that m = m^^^ — > cx), 
thus the bound on I2 is much smaller than the one on 1\ for large enough 
n. Therefore, just as in (j5.5p . 



|ELs^„ - m| < 4A/m logm(/i + 

< 4a/ 7r(l + c)m + 4 m log m rf^l^^~'^^ exp ( — m — — ) 

V 4(l+c)y 



< 5 V^(l + c) 



771. 



Hence EL^ „ is of the same order of magnitude as m = msn- Since sn > 



n 



l-T 



00, we can use Lemma 16.21 obtaining that for large enough n. 



(6.1) rrig^n < 3.431^^. 

Again for fixed 7 and for large enough n, 

P(|L,,„ - EL,,„| > 7\/logns (ns)^/^) 

< P(|L,,„ - m| > TV^logns (ns)^/^ - |EL,,„ - m|) 

< P(|L,,„ -m\> 7v/logns (ns)^/^ - 5V^(l + c)V^), 

and by (l6T^ . 



7V^logns (ns)^/^ - 5-v/7r(l + c)^/rn > 7^ 



m. 
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Since for large enough n, 7y^3/3.44 < (3o = cy^m/ log m, (jS.ip applied to 
Lg^n and (|6.ip finally implies 



< F{\Ls,n -m\> 7^ ^^\/"ilogm) 

< 4m-37Vl3.76{l+c) < („^)-7Vl4. g 

Remark. The proof also yields that for any < A < B < oo, there exists 
A'^ (depending on A and B only), such that the inequality of Theorem 16.11 
holds for any 7 G -B] and for every n > N. 

7. Geometric lemmas 

For the proof of Theorem 11.31 we need further preparations. We start by 
assuming that K is a convex compact set in the plane and A{K) > 0, and 
Xn is a random sample of n uniform and independent points from K. We 
need to estimate the probability that X„ is in convex position, that is, no 
point of Xn is contained in the convex hull of the others. We denote this 
probability by convex in K). 

Lemma 7.1. If K is as above, 

~ /240 
F{Xri convex in i^) < 

\ 

Proof. Let P be the smallest area parallelogram containing K. As is well 
known, A{P') < 2A{K). Let X* be a random sample of n uniform and 
independent points from P. In this case a (surprisingly exact) result of 
Valtr |llj says that 

(2r) — 2^ ^ 
1 

Now we have 

F{Xn convex in K) = ¥{X* convex mP\X*cK) 

_ F{X* convex in P and X* C K) 

~ nx* c K) 

F{X* convex in P) 



< 



Fix* C K) 



where the last step is a straightforward estimate. □ 

^From now on we work exclusively with the standard triangle T. 

Assume next that K \s a, convex subset of the triangle T, and let Xn be 
random sample of n uniform and independent points from T. We define 
M(K, n) as the random variable 

M{K, n) = max{|y| : Y C Xn H X is in convex position}. 
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^From Theorem 13. II it is not hard to determine what the asymptotic expec- 
tation of M{K, n) is. But what we need is that M(K, n) is large with small 
probability. This is the content of the next lemma. 

Lemma 7.2. Let K be a convex subset ofT. Then for any positive integers 
n and fi satisfying 1920 ^i^) ^ z^^^; 

Proof. If M{K, n) > /x, then Kr\Xn contains a subset of size which is in 
convex position. Lemma l7. II and the union bound imply that 

P(M,i..n),.||KnX„|^.),Q(?^)%(^)". 

The random variable \K n X„| has binomial distribution. Thus we have 
¥{M{K,n) > n) 



J2 r{M{K, n)>fi\\KnXn\=k)r ] {2A{K))''{1 - 2A{K)) 
Emm {l, (^)''} Q(2W(1 - 2Am)- 



n~k 



< 

k=fj, 



fc<fco k=ko 

Here we choose to be equal to /x^/(480e). Then 

fc<fco k<k() ^ ^ ^ 

Since {1){2A{K))''{1 - 2A{K))'^^'' is decreasing for k > 2A{K)n, and the 
condition on fi guarantees that ko > 2A{K)n, 



^[..] < n(^"J(2A(i^))'="(l-2^(K)) 



n~ko 



k>ko 

V / \ ko 

< n2-'=« =n2-^'/(480^). 

□ 

For the proof of Theorem 11.31 we will consider other parabolas that are 
similar to F. Let F^ be the parabola defined by the equation ^/x + ^ = 
\/l + r where the parameter r G (—1,3). The graph of F^ is the homothetic 
copy of F with ratio of homothety 1 + r, and center of homothety at the 
origin, see Figure [3] a). Assume the point {a,b) is on F. Then the point 
((1 + r)a, (1 + r)b) is on F,., and the tangent line to this point on F^ is given 
by the equation 

^ + ^ = 1 + r. 
V« Vb 
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It follows that the distance between parallel tangent lines to T and F,. is 



(7.1) 



Define now 



< 



a ^ b 



V8- 



p = y/8s = 3\/27^/2^~^/^2(logn) 



1/4 



here e comes from Theorem 11.31 This definition immediately implies the 
following fact. 

Proposition 7.1. If a convex chain C{Y) lies between r_p and Tp, then 
dist{C{Y),T) < e. 

We need one more piece of preparation. Assume £ is a tangent to r^., 
at the point q. With the notations of Section [21 let Ti and T2 denote the 
two triangles determined by £ and see Figure E] a). Let X„ be a random 
sample of n points from T and let U denote the length of the longest convex 
chain in Tj, i = 1, 2. 





a) b) 
Figure 3. Convex chains far from F 



Lemma 7.3. For sufficiently large n, if \r\ > n then 

.2 3/ 



EL^ + EL^ < ELn - 0.52 r" <^n. 

Proof. Let ti = 2A(Tj) for i = 1,2. We want to apply Theorem [221 It is 
not hard to see (using Corollary 12.21 for instance) that what is denoted by 
|a — 6| there, is equal to |r| here. Consequently 

1 



(7.2) 



' 3 



Write U for the longest convex chain in the triangle Tj. By affine invari- 
ance U has the same distribution as L(j,n (from Section [6|) for i = 1, 2. We 
need to estimate EL„ — (EL-^ + EL^) from below. 

For four points go = (0,1), and 53 = (1,0) in this order on F, 

denote by Si the triangle delimited by the tangents to F at qi, and by 
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the segment [qi-i,qi], i = 1,2,3; see Figure[3]b). Choose qi and q2 so that 
A{Si) = ti/2 and A(S2) = t2/2. Then Corohary O and imply that 



1 2 



Let now A* denote the length of a longest chain in Si for i = 1,2,3. For 
i = 1 and 2, A* has the same distribution as Lt-,n (and as L*). Therefore 
EL* = ELt^^n = EA^ for i = 1, 2. Further, A^ + A^ + A^ < L„ follows from 
concatenating the longest convex chains in the triangles Si. Thus we have 

3 

(7.3) EL^ +EL2 + EA3 = ^EA^ < EL„. 

i=l 

The random variable iXonSal has binomial distribution with mean 2A{S3)n 
which is at least n = {l/Sfr^n > {l/Sfn^/"^. Set N = n - V^logK. Thus 
we obtain that for all large enough n, 

0.99 

and N tends to infinity with n. Using the estimates (|2.ip and ()4.2p . again 
for large n we have 

EA^ > F{\Xn n S'al > N)KLn > (1 - t'^/^) 1.57 atVs 
> 1.5697V^/^ > 0.52 r^^. 

Hence, by 

EL^ + EL^ < ELn - 0.52 

□ 



N > 0.99 k; = ^^r^n, 



8. Limit shape 

After the preparations in the previous sections we finally prove Theorem 
11.31 that is, all chains in C lie in a small neighbourhood of F with high prob- 
ability. Note that similar limit shape results have been proved for convex 
chains [4j; however, they are of different character than the present case. 

We fix the constant 7 > 1. Every result in this chapter holds for large 
enough n, depending only on 7. We will not always mention this. 

For this proof we set b = 'y'n^^^^/logn. The strong concentration result of 
Theorem 11.21 directly shows that 

P(L„ < ELn -b)< n"^"/^^. 

We call a convex chain Y C Xn long if its length is at least EL„ — b. 

We will show that all long convex chains lie between the parabolas Tp 
and F_p with high probability, where high means > 1 — n"'''^/^^. In view of 
Proposition 17.11 this suffices for the proof. 

Let S be the triangle with vertices (0,0.1), (0,0), (0.1,0), and define H to 
be the event that there is a long convex chain Y C Xn having a point in S. 
We prove first the following simple fact. 

Lemma 8.1. For n large enough, 

F{H) < n-T'/^ 
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Proof. Let y be a long convex chain with a point in 5, and let y be a point 
of Y where the tangent to C{Y) has slope 1. Clearly y £ S. Let Yi be the 
part of Y between (0, 1) and y, and Y2 be the part between y and (1, 0). Then 
Yi resp. Y2 are convex chains in the triangle 5i = conv{(0, 1), (0, 0), (0.1, 0)} 
and ^2 = conv{(0, 0.1), (0, 0), (1, 0)}. As y is a long convex chain, 

ELn - 6 < |y| < |yi| + 1^2! < + 

where U denotes the length of the maximal chain in Si {i = 1,2), \Yi\ < U. 
As n — > 00, the limit of n~-^/^EL„ resp. n~"^/^EL* is a and a v^O.l. This 
follows from Theorem [LT] and Lemma[621 So limn~^/^(EL„ — EL^— EL^) = 
a(l — 2-v^O.l) > 1/10, implying that for large enough n 

EL„ - EL^ - EL^ > > 36 = 87^^/^ y^logn. 

So we have 

P(F) < P(l1 + > EL„ - h) 

= P(L^ + > EL^ + EL^ + (EL„ - EL^ - EL^) - h) 
< F{L^ + > EL^ + EL^ + 26) < ^ P(L'' > KL' + b). 

i=l,2 

The triangle Si is of area 1/20 so Theorem 16.11 shows that 

¥{V > EV + b)= P(L* > EL* + 7n^/Vlogn) 
< F{L' > EL* + 7 20^/^(n/20)^/^A/iogn720) 

- V20/ - 2 

After this first step, we estimate the probability of the existence of a long 
convex chain not lying between r_p and Tp. First, we deal with the case 
when the chain goes below this region. 

We define a set of parabolas. Let A = n^^/'^ylogn, = —p — iA, and 

(8.1) Gi = Fj.- where i = —1, 0,1, ... ,g. 

Note that < 0. Here we define g by the conditions Gg <Z S but Gj_i is not 
contained in S. Thus the case when a long chain goes below Gg is covered 
by Lemma l8.ll Clearly g is limited by —1 < rg = —p — gA > — 1 + 1/10. 
Thus g < n^/'^, say. 

The convex polygonal chains C{Y) can be considered as functions defined 
on [0, 1]. We extend the definition of F,. as on the interval [1 + r, 1] if r < 0, 
and consider this new "parabola" F,, as a function defined on [0,1]. A 
parabola is said to be below, resp. above C{Y) if the corresponding function 
is smaller (larger) than the one corresponding to C{Y). 

The following lemma is important. 

Lemma 8.2. There are points qij E j = 1,2, J(i) with J{i) < 

n^/^, such that the upper envelope of the tangent lines i{qij) of Gi-i at qij 
is a broken polygonal path lying above Gi . 
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Proof. The line which is tangent to Gi-i at q E intersects the 

graph of Gi in two points. Let Xq denote the segment connecting these 
two points. It is not hard to check that the length of the segment, |A((7)|, 
decreases as q moves away from the center point of Gi^i. A simple compu- 
tation reveals that 

(8.2) AA^^^^ < |A,| < x/2A(l + r,), 

where q only moves up to the point when both endpoints of X{q) lie in Gi. 

Now choose qi^i on Gj_i so that the lower endpoint of X{qi^i) is the inter- 
section of Gi with the x-axis. Once qij has been defined, we let qij+i be 
the point in Gi-i for which the lower endpoint of X{qij^i) coincides with 
the upper endpoint of X{qij), see Figure H] a). The length of Fj is smaller 
than 2(1 + rj). So the process of choosing the qij stops after 

' ^ 4A(l + r,)2 -2A(l + r,)- 

steps. This finishes the construction of the points qi^j. The upper envelope 
of the tangent lines i{qij) is a convex polygonal path that lies between Gi 
and Gi-i with edges X{qij). □ 




Figure 4. Long chains below F 

Now we define G* to be the event that there is a long convex chain Y C Xn 
with Gi+i below G{Y) but Gi not below C{Y), i = 0,1, . . . , g - I. 

We split these events further. Let G* ^ be the event that there is a long 
convex chain Y with the parabola Gj+i below C(Y) but the line l{qij) not 
below C(Y); here qij £ Gi-i comes from Lemma 18.21 This implies that 
G* C UjeJ(j) Gij- 

Lemma 8.3. For every i = 0, . . . , g — 1 and every j = 1, . . . J{i), P(G*j) < 
Before the proof we state (and prove) the following corollary. 
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Corollary 8.1. The probability that there is a long convex chain Y C X„ 
such that C'{Y) is not above r_p is at most n~'^ + 3n~'^ Z"^. 

This is quite easy: If there is such a chain, then either H, or some G* 
(z = 0, 1, ... ,5 - 1) occur. Since G* C Uj6J(i) ^ij' dJii) < n and 7 > 1, 
the corollary follows from Lemmas 18.31 and 18.11 

Proof of Lemma 18.31 Let Ti, T2 be the two triangles determined by qij and 
£{qij) as usual, and let K = Kij be the convex set between X{qij) and Gj+i, 
see Figure Hlb). 

We estimate A{K) as follows. A simple calculation as in ()8.2|) yields that 
the diameter of K is at most 2^/A, and K is between the line iiqij) and the 
parallel line tangent to Fj+i. The distance of these lines is at most 2A/^/8 
as one can easily check using dH]). Then A{K) < V2A^/^. 

A long convex chain Y C X„ which is above Gj+i but not above i{qij) 
splits into 3 parts: Yi = Ti n Y, Fa = T2 n Y, and Y^ = K n Y. Here Yi, H 
are convex chains in Ti (from (0, 1) to qij) and in T2 (from qij to (1,0)), 
and Y3 is in convex position in K. So with the notations of the previous 
section we have 

\Yi\ < L\ \Y2\ < L^ and \Ys\ < M{K,n). 

Since y is a long convex chain, lYil + II2I + |^| ^ ]EL„ — b. This implies 
that + L'^ + M{K, n) > EL„ — b. We are going to show that this event 
has small probability. 

We apply Lemma 17.21 with /x = 6/5. For large enough n it implies that 

(8.3) F{M{K,n) > 6/5) < (6/5)^2-^/5 + n2-'''/(480e53) ^ ^-n^/^ ^ n'^^'/^ 

since the condition 1920 A{K)n < {b/5f is satisfied as A{K) < V2A^/'^ < 
V2n-V2(logn)3/4 and (6/5)^ = -f^n^/^{lognf/^ /125. 
Next, 

F{L^ + + M{K, n) > EL„ - 6) 

(8.4) < P(L^ + > ELn - 1.2 6) + F{M{K, n) > 6/5) 

< P(L^ + > ELn - 1.2 6) + n-^^'/\ 

Now Lemma ED implies that EL^ + EL^ < ELn - 0.52rf_^-^, and hence 
P(L^ > ELn - 1-2 6) 

(8.5) < F{L^ + > EL^ + EL^ + 0.52rf_i ^ - 1.2 6) 

< P(L* > EL' + 0.26rf_i ^ - 0.6 6) 

i=l,2 

< P(L* > EL* + 46). 

1=1,2 

Here the last step is justified by observing that rj_i < r_i = —p + A and 
so for large enough n 

0.26 rf ^ > 0.26 n^/^ {3V2-f^/^n-^/^^{log nf'^ - n-^l'^^\ognf 

(8.6) > 4.67 n^l^^JXogn = 4.6 6. 
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Next, we estimate P(L* > EL* + 46). When U = 2A{Ti) > n"^/^, we use 
Theorem 16. II with r = 5/6: 

P(L* > EL* + 46) = P(L* > EL* + 47v^logn n^/^) 

< P(L* > EL* + 47Vlogn/log(nti) x/log(nti) (nti)'/^) 

The last inequality holds because of the Remark following Theorem 16. H 
since 

1 < 47 Y^log n/ log{nti) < jAVQ. 

Finally, when tj < n^^/^, the expected number of points in Tj is tin < 
nV6. So for the random variable \Ti CiXnl inequality (j2.2p implies that 

V 47 V log ny 
for large enough n, and hence 

P(L* > EL* + 470^ n^/^) < n-^^'/^ 

Thus P(L* > EL* + 46) < n-^f^/'^ for i = 1, 2 in ah cases. □ 

Now we handle the case of parabolas going above Tp. Set Ri = p + iS 
where 5 = n~^/^-^/Iogn. We define another series of parabolas: 

(8.7) Gi = rn,, i = -1,0,1,..., f 

where / is limited by p + /5 < 3. Thus / < n^/^, say. 
The following geometric lemma is similar to Lemma 18.21 

Lemma 8.4. There are points pij G Gi-i, j = 1,2,... with J{i) < 

n^/^ such that the following holds. For each convex chain Y C X„ with Qi-\.i 
above C{Y) hut Qi not above C{Y), there is a pij such that the line i{pi,j) 
is below C{Y). 

Proof. For such a long chain Y there is a smallest R > p with F/j above 
C(y). Then C{Y) and F/j have a common point and a common tangent £ 
at that point (because both C{Y) and Fr are convex). Let p be the point 
on Qi such that the line l!.{p), tangent at p to Qi, is parallel with I. It is 
evident that C{Y) is above l{p). 

Let L denote the set of lines that are tangent to Qi and that have both 
(0,0) and (1, 1) above it. We will construct a set of points pi^j G Qi-i such 
that each line in L is above the segment i{pij)ri T for some j = 1,2, . . . , J{i). 
This construction then guarantees what the lemma requires. 

We need one more piece of notation. Given pij let [Aj,Bj\ be the segment 
T n i(pij), with Aj on the x-axis and Bj on the y-axis. We shall construct 
the sequence of the Aj^s and -Bj's. 

The construction starts with pi^i at the midpoint of Qi-i and we define 
first the other pij with Ai closer to the origin than Aj. See Figure [5] a). 
Assume pij has been found. There is a unique tangent, £, to Qi passing 
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through Bj. Let Aj^i be the intersection point of £ with the x-axis, and 
Pi,j+i the common point of Qi-i with the tangent to Gi-i through Aj^i. 
The construction is finished when we reach x{Aj) < 0, here x{Aj) denotes 
the x-coordinate of Aj. Corollary 12.21 imphes that 

\AjAj+i\ = \BjBj+i\ = (1 + R,) - (1 + Ri^i) = 6. 

Since x{Ai) < 1/2, we reach x{Aj) < after at most {25)~^ steps. 

The construction satisfies what we need: if a tangent to Qi intersects the 
triangle in the segment [A, B] with A on the x axis and x{A) G [0, 1/2], then 
A is between ^j+i and Aj for some j, and the segment -B] is above the 
segment i{pij) n T. 

The construction is extended to the other half of Qi-i symmetrically, and 
J{i) < 2(2(5)-^ < follows. □ 




Figure 5. Long chains reaching above F 

Next we define Q* {i = 0, 1, ...,/ — 1) to be the event that there is a 
long convex chain Y C X„ such that Qi+i is above CiY) but Qi is not above 
C(y), i = 0, 1, ...,/ — 1. Further, let Q* j be the event there is a long convex 
chain Y C with C(Y) below Qi^i but not below i{pij) (remember that 
Pi,j G Here z = 0, ...,/ — 1 and j = 1, . . . , J^{i)- We have now the 

following result, similar to Lemma 18.31 

Lemma 8.5. For every i = 0, ...,/ — 1 and every j = 1, . . . < 

This lemma immediately implies the following corollary. 

Corollary 8.2. The probability that there is a long convex chain Y C X„ 
such that C{Y) is not below Tp is at most 3n~'^ 1'^ . 

The proof follows from the facts that Q* C Uje:7(i) ^i'.j' / ^ ^^^^ ^(^) ^ 
n^/^, and 7 > 1. Now we give the proof of Lemma 18.31 which is analogous 
to that of Lemma 18.31 
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Proof of Lemma |8.5[ Let i{p) be the unique tangent to Gj+i which is paral- 
lel with £(pij), and p be the common point of i{p) and Ti^i, see Figure [5] b). 
Let Ti,T2 be the two triangles determined by p and i{p), and let K = Kij 
be the part of T that lies between i{pij) and i{p)- Since the distance of 
these two lines is at most 25/\/8, A{K) < 6. 

A long convex chain Y C Xn which is below but not below i{pij) 
splits into 3 parts: Yi = Ti n Y, = ^2 n Y, and Y3 = KnY. Here Yi, H 
are convex chains in Ti (from (0, 1) to p) and in T2 (from p to (1, 0)), and 
Y3 is in convex position m K. So 

\Yi\ < L\ \Y2\ < and \Ys\ < M{K,n). 

Since y is a long convex chain, lYil + II2I + |^| ^ 1^1 ^ — b, and so 
+ + M{K, n) > ELn — b. We are going to show that this event has 
small probability. 

We apply Lemma [7.2l again with /x = 6/5. For sufficiently large n the con- 
dition 1920 e^A{K)n < {b/bf is satisfied, since A{K) < 5 = n-'^/'^^/logn 
and (6/5)^ = 7'^n^/^(logn)^/^/125. So we have, just as in (|8.3|) . 

F{M{K,n) > 6/5) < n"'^'^'/^. 

Therefore the estimate (18. 4p applies without change: 

F{L^ + M{K, n) > EL„ - 6) < F{L^ + > EL„ - l.2b) + n"^"^"/^. 

Now Lemma ES implies that EL^ + EL^ < EL„ - 0.52i?f_^^ and just as 
in dSSD, 

P(L1 L2 > EL„ - 1.2 6) < ^ W{V > EL' + 0.26Rf^^ ^ - 0.6 6) 

i=l,2 
1=1,2 

Here the last step is justified just as in ()8.6p except that this time Ri+i > 
Ri = p + 5. Finally, we bound P(L* > EL* -|- 46) the same way as in the 
proof of Lemma 18.31 to obtain 

F{V > EV + 46) < n"^'^"/^. n 

Proof of Theorem II. 3L Considering Proposition 17. H we have to estimate 
the probability that there is a longest convex chain not lying between r_p 
and Tp. This event splits into two parts: either the longest convex chain 
is not long, or there is a long convex chain not between r_p and Tp. The 
probability of the first event is estimated by Theorem II. 2t while the second 
part is handled via Corollaries 18.11 and 18.21 Therefore the probability in 
question is at most 

Remarks. In this proof one can avoid using the estimate on M(K,iJ,). 
In fact, choosing 5 and A small enough, the set K contains more than 6/5 
points of Xn with very small probability. So, with high probability, it cannot 
add much to the size of a long convex chain. There are more events G* and 
G*j, which has a minor effect on the final result. Also, the triangle S in 
Lemma |8. II is to be chosen much smaller. 
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An important step in our proof is Lemma 17.31 essentially implying that 
if the distance between F and the farthest point of a convex chain from T is 
"large" , then the chain cannot be too long. Conditioning on the location of 
this farthest point would allow an elegant conditional expectation argument. 
However, fixing the farthest point modifies the underlying probability space 
and therefore the estimate coming from Lemma 17.31 is no longer valid. To 
eliminate this difficulty, we chose to define finitely many subcases and esti- 
mate them separately, which can also be considered as a finite approximation 
of the continuous conditional expectation. 

9. Numerical experiments 

In the final section we summarize the observations obtained by computer 
simulations. 

The search for the longest convex chains can be accomplished by an al- 
gorithm which has running time O(n^). This algorithm works as follows. 
We order the points by increasing x coordinate, and then recursively create 
a list at each point. The kth element on the list at point p contains the 
minimal slope of the last segment of chains starting at po and ending at p 
whose length is exactly k, and a pointer to the other endpoint of this last 
segment. For creating the list at the next point p, we have to search the 
points before p, and see if p can be added to the chains while preserving 
convexity. 

This algorithm can be speeded up with some (not fully justified but useful) 
tricks. First of all, Theorem 11.31 guarantees that we have to search only 
among the points close to F. The simulations show that most longest convex 
chains are located in a small neighbourhood of F, whose radius is in fact 
of order approximately n~^^^, much smaller than the width of order n~^/^^ 
given by Theorem 11.31 Therefore the search can be restricted to a subset 
of the points with cardinality of order n^/^. Second, when looking for the 
longest chain, we have to search only points relatively close to p, and chains 
which are already relatively long, thus reducing memory demands. 



n 




dn 


Distance/ \/2 


Deviation 


1000 


2.532 


4 


0.270 


1.254 


10000 


2.768 


5 


0.200 


1.383 


15625 


2.813 


5 


0.150 


1.293 


50000 


2.885 


5 


0.100 


1.411 


75000 


2.906 


5 


0.070 


1.580 


100000 


2.917 


5 


0.060 


1.431 


125000 


2.926 


5 


0.050 


1.637 


421875 


2.959 


5 


0.012 


1.732 


1000000 


2.976 


6 


0.012 


2.023 



Table 1. Results obtained by the simulation 



With the above method, the search can be executed for up to 5 • 10^ 
active points, in which case examining one sample takes about 2 minutes. 
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As the experiments show, this provides a good approximation for n's up 
to order 10^. In each experiment, we increased the width of the searched 
neighbourhood until the increment did not generate a significant change in 
the average length of the longest convex chain. The results obtained by this 
method, although giving only a lower bound for EL„, are heuristically close 
to it. q: = 3. 



2.98^ 
2.96 
2.94 
2.92 

2.9 
2. 88 
2.86 
2.84 
2.82 

2.8 
2.78 



Figure 6. Results for n ^/^EL„, illustrated as a function of n^/^. 

Our largest search has been done for n = 10^. The number of samples was 
250 except for the cases n = 25'^ and n = 10^, where we used 500 samples 
in order to model the distribution of L„ (see Figure [7]) . 

The results below well illustrate what the proof of Theorem 11.11 suggests, 
namely, that n~^/^EL„ is increasing with n. Also, the data seem to confirm 
that a = 3. 

On Table 1 we list the results obtained by the program. The first column 
is the number of points chosen in T, the second is the average of n^^^^Ln- 
The third column contains the half-length of the interval of the values of -L„, 
that is, dn = [max|L„ — EL„|J. This is noticeably small even for n = 10^. 
In the fourth column we list ^./^/2 times the radius of the neighbourhood of 
parabola we used for the search (the term ^/2 comes from a transformation 
of coordinates). The last data are the standard deviation of the set of values 
of Lm ie. the square-root of its variance. 
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Figure 7. Distribution of L„, 500 samples, n = 25'^ and 



n 



10^. 



LONGEST CONVEX CHAINS 



25 



Figure [6] illustrates the linear interpolation of n^^/'^EL„ as a function of 
n^/^. It is based on the data shown on Table 1. 

As we know from Theorem II. 2 1 L„ is highly concentrated near its expec- 
tation. This phenomenon is well recognizable on Figure [3 where we plot the 
distribution in the cases n = 253(= 15625) and n = 10^ with 500 samples. 
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